Prediction of protease substrates using sequence and structure features
نویسندگان
چکیده
MOTIVATION Granzyme B (GrB) and caspases cleave specific protein substrates to induce apoptosis in virally infected and neoplastic cells. While substrates for both types of proteases have been determined experimentally, there are many more yet to be discovered in humans and other metazoans. Here, we present a bioinformatics method based on support vector machine (SVM) learning that identifies sequence and structural features important for protease recognition of substrate peptides and then uses these features to predict novel substrates. Our approach can act as a convenient hypothesis generator, guiding future experiments by high-confidence identification of peptide-protein partners. RESULTS The method is benchmarked on the known substrates of both protease types, including our literature-curated GrB substrate set (GrBah). On these benchmark sets, the method outperforms a number of other methods that consider sequence only, predicting at a 0.87 true positive rate (TPR) and a 0.13 false positive rate (FPR) for caspase substrates, and a 0.79 TPR and a 0.21 FPR for GrB substrates. The method is then applied to approximately 25 000 proteins in the human proteome to generate a ranked list of predicted substrates of each protease type. Two of these predictions, AIF-1 and SMN1, were selected for further experimental analysis, and each was validated as a GrB substrate. AVAILABILITY All predictions for both protease types are publically available at http://salilab.org/peptide. A web server is at the same site that allows a user to train new SVM models to make predictions for any protein that recognizes specific oligopeptide ligands.
منابع مشابه
PROSPER: An Integrated Feature-Based Tool for Predicting Protease Substrate Cleavage Sites
The ability to catalytically cleave protein substrates after synthesis is fundamental for all forms of life. Accordingly, site-specific proteolysis is one of the most important post-translational modifications. The key to understanding the physiological role of a protease is to identify its natural substrate(s). Knowledge of the substrate specificity of a protease can dramatically improve our a...
متن کاملA QSAR Study of HIV Protease Inhibitors Using Computational Descriptors to Prediction of pki of Cycle Derivatives of Urea
Preventing and reducing the spread of HIV (HIV) has always been a concern in medical science. One of the most common ways to control the virus is using enzyme-blocking drugs. In this study, we attempted to predict the biological activity (PKi) of organic urea derivatives in protease inhibitor compounds using molecular modeling using QSAR (Quantitative Structure Activity Relation), which is the ...
متن کاملBioinformatic Approaches for Predicting substrates of Proteases
Proteases have central roles in "life and death" processes due to their important ability to catalytically hydrolyze protein substrates, usually altering the function and/or activity of the target in the process. Knowledge of the substrate specificity of a protease should, in theory, dramatically improve the ability to predict target protein substrates. However, experimental identification and ...
متن کاملTHE DESIGN, MODELING AND EVALUATION OF POTENTIAL HIV PROTEASE INHIBITORS USING BLITZ, AN INTERACTIVE COMPUTER GRAPHICS WORKING TOOL
Several nonpeptide small molecules were designed as potential inhibitors of HIV protease and their structures were constructed by computer-aided molecular modeling and docked iwo the active site of HIV protease. Models of the complexes of inhibitors and the HIV protease were refined using nonbonded and H-bonding terms. The refined energy of selected complexes showed that the designed inhib...
متن کاملCloning and Enhanced Expression of an Extracellular Alkaline Protease from a Soil Isolate of Bacillus clausii in Bacillus subtilis
in the detergent industry. In this study, the extracellular alkaline serine protease gene, aprE, from Bacillusclausii was amplified by PCR and further cloned and expressed in B. subtilis WB600 using the pWB980 expression vector. Protease activity of the recombinant B. subtilis WB600 harboring the plasmid pWB980/aprEreached up to 1020 U/ml, approximately 3-folds higher than the nativ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 26 14 شماره
صفحات -
تاریخ انتشار 2010